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In a seminar which I attended as a new graduate student in 1966-67, the late Dennis 
Sciama said that when he himself had started work in cosmology in 1950 there was only 
one known fact about the Universe - and that fact later turned out to be wrong! The 
fact was that the Universe was expanding, which was known from Hubble’s law relating 
the magnitudes and redshifts of galaxies, and what was wrong with it was the expansion 
rate. Hubble’s data led, in the simplest interpretation, to a Universe younger than the 
geologically-known age of the Earth. 

Redshift z can be interpreted as due to the Doppler effect, giving a velocity of 
recession, while magnitude m gives a measure of distance (how, and the units and order 
of magnitude of the distances, are discussed below in Section 3.2). It was the distance 
scale in Hubble’s law that was in error. In the 1950s, and subsequently, that error was 
corrected, initially by a factor about 2.5 but now by a cumulative factor of order 7. 
Thus from 1952 onwards the Friedman-Lemaitre-Robertson-Walker (FLRW) expanding 
universes became acceptable models for the real Universe though it was not until the late 
1960s that they became clearly the dominant paradigm. North’s (1965) philosophical 
and Ellis’s (1989) historical account, which includes an annotated bibliography, give 
more detailed background up to about 1960, and many more references to original 
papers and secondary sources than there is space for here: see also Bondi (1960). 

The incorrect distance scale was of less importance than the revolution in 
humanity’s picture of the Universe that the inferred expansion implied. It was obvious 
the heavens were not static, since solar system bodies described forms of periodic motion, 
and it was also known they did change, as shown e.g. by the Chinese observations 
of the supernova that formed the Crab nebula, and by the comet of 1577 for which 
Tycho measured a parallax, demonstrating that it was moving through the zone of the 
planets. Nevertheless, from Aristotle onwards, much philosophical and religious thought 
considered the “fixed stars” to live up to their name, so models of the large-scale universe 
before Hubble’s work were generally static: such a picture was also supported, though 
not unambiguously, by the available observations. 

Despite the problem of the discordant ages, during the half-century after Hubble’s 
paper, expansion became fundamental to understanding the nature and evolution of the 
matter in the Universe, in particular: the formation of the chemical elements, from the 
combined effect of Big Bang and stellar nucleosyntheses; the resulting inference of the 
existence of at most 5, and probably 3, types of neutrino; and the prediction of the 
Cosmic Microwave Background (CMB). 

By 1980 there was therefore a well-established standard model, or rather a set of 
models, FLRW universes containing pressureless matter (“dust”) and radiation, which 
agreed with all the principal features of the observed Universe as then known - except 
one. The most obvious fact about the Universe is that its density is not uniform - it 
is lumpy - and within the models of the time this was explicable only as coming from 
(rather unnatural) primordial irregularities. 
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Inflation theory, our current best explanation for the lumpiness, was introduced in 
the 1980s. The generation of the necessary fluctuations during a period of inflation, 
when the universe is driven by an unknown held named the “inflaton” and is expanding 
very rapidly, is now part of the cosmological standard model, the “concordance model”. 
It involves quantum fields in the early universe which produce a fluctuation spectrum, 
at the end of the inhationary period, that matches the initial conditions required for a 
subsequent classical evolution which gives the observed structures and phenomena. The 
theory of inflation is discussed in the companion Milestone review, Durrer (2015). 

The concordance model gives predictions for the spectrum of variations in the CMB 
and their evolution. These predictions depend on the behaviour of quantum and classical 
holds in an expanding universe, and the evolution of perturbations during expansion. 
The link between the spectrum of huctuations at the end of inhation and the present- 
day density variations is provided by the (relativistic) theory of classical perturbations 
of the expanding FLRW models. The resulting explanation of the density variations 
thus represented an additional success for Einstein and Hubble. 

There was still a surprise to come. In the late 1990s two groups announced, on the 
basis of the magnitude-redshift relation for supernovae of type la (i.e. still using the 
same principles as Hubble’s study) that the Universe’s expansion was accelerating. This 
evidence, coupled in particular with the CMB observations and the “baryon acoustic 
oscillations” (BAO) found in galaxy redshift surveys, led to our current picture in which 
the total energy density of the Universe is close to the critical density, the boundary 
between ever-expanding and contracting models, and made up of under 5% visible 
matter, about 25% “dark matter” and about 70% “dark energy”. 

Additional evidence is, or is expected to be, available from many types of 
observation such as gravitational lensing studies (using another consequence of Einstein’s 
theory: see Will (2015)), gravitational wave detection, observations of individual 
galaxies, and terrestrial dark matter experiments, as well as from refinements of the 
CMB, BAO and SNla data. 

It is interesting that although the results from the SNla measurements were 
unexpected, they could be regarded as theoretically predicted in that it had been shown 
to be necessary to add a substantial A to the perturbed FLRW models to get agreement 
with observation (Efstathiou et al. 1990, Ostriker & Steinhardt 1995). 

There are still big open questions about the expanding universe, the most obvious 
being the natures of the inflaton (the cause of inflation), of dark matter and of dark 
energy. 

This review will discuss the above points and aim to provide clear indications of 
the fundamental importance of both relativity and the Universe’s expansion to our 
understanding of cosmology. 
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2.1. Observations preceding Hubble’s 

For metaphysical reasons many people have had a strong bias towards a static and 
unchanging Universe, albeit one including growth and/or change in individual lives, in 
the motion of Solar System bodies, and so on. Such views lay behind the development 
of the Steady State theory (Bondi 1960). However, it is important to realise that there 
were also observational reasons for astronomers to favour a static universe. 

The readily visible stars in the Milky Way, our Galaxy, occupy a rather irregular 
but roughly coin-shaped region of what we now understand to be the disk of a spiral 
galaxy. In Herschel’s universe, from 1785, these stars were taken to be the whole matter 
content of the Universe, beyond which there was only empty space; and the Sun was at 
the Universe’s centre. We now know that this incomplete picture arose because stars 
further away in the spiral arms of the galaxy, and its bulge, are obscured from observation 
in visible wavebands due to gas and dust. (The modern spectacular observations (Eckart 
& Genzel 1996, Ghez et al. 2008) supporting the presence of a supermassive black hole 
at the centre of the Galaxy are made in the infrared.) The extensive work summarized 
in Kapteyn (1922) led to an ellipsoidal model 3 kpc thick and of radius 15 kpc, with the 
Sun near the centre (1 parsec, a pc, is 3.1 x 10 13 km = 3.26 light-years; see section 3.2). 

The alternative “Island Universe” concept, that the nebulae were other star systems 
like the Galaxy, the viewpoint that Herschel had hoped to confirm, was introduced by 
eighteenth century astronomers and philosophers. It was supported by the nineteenth 
century resolution of some nebulae into stars (see North (1965)), but was the less- 
favoured option, by most astronomers, until the 1920s. The arguments against concerned 
the relative sizes of the Galaxy and the nebulae, and nebular spectra (an argument in 
which very different types of nebulae were conflated). 

The two decades leading up to Hubble’s announcement saw a great deal of work 
on distant stars and nebulae, covering the discovery of the shape and size of our own 
Galaxy and the first redshifts and distances of extragalactic nebulae. The numbers of 
references in the papers that are cited in the following summary show how much more 
was going on. 

Slipher (1914) had noted that the shapes of spectral lines from spiral nebulaef 
implied that those nebulae rotated, van Maanen, initially in measurements of M101 
(van Maanen 1916), alleged, on the basis of attempts to measure proper motions, that 
the rotation periods were of the order of 85,000 years. (For a full account of van 
Maanen’s work, and references, see Hetherington (1972).) This could only make sense 
if those nebulae were within the Galaxy. A similar inference was drawn from the 1885 
observation of a supernova in M31, misidentihed as merely a nova (it was later used to 
infer a distance ~ 200 kpc for M31 by Lundmark (1919); although this was a significant 

f “Nebulae” means clouds, in Latin: the distant agglomerations of stars looked like clouds in the 
telescopes of the time. 
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underestimate it sufficed to show that M31 was outside the Galaxy). That inference 
helped to delay the recognition of the true nature of the other galaxies in the Universe. 

These viewpoints began to change due to other observations made from the 1910s 
onwards. In 1912 Slipher became the first to measure the redshift of an extragalactic 
nebula (Slipher 1913), and found that M31, the Andromeda nebula, approaches us at 
about 300 krn/s. (It can be argued from this and his subsequent papers that Slipher 
deserves a large part of Hubble’s credit for finding the expansion (Peacock 2013).) 

Shapley (1918) measured the distances to a number of globular clusters of stars and 
showed that they fill a sphere centred at the Galactic centre, giving the first indication 
of the true shape and scale of the Galay and the Sun’s position within it. Globular 
clusters are visible out of the plane of the Galaxy, but the absorption still present led 
Shapley to overestimate the distance of the Galactic centre by a factor 2 (his figure was 
20 kpc). 

The two views on the nature of the nebulae gave rise to a “Great Debate” between 
Shapley and Curtis in 1920 (see Trimble (1995)), Shapley arguing that the Galaxy was 
the whole Universe, and Curtis arguing that the nebulae were “island universes”, on 
the grounds of the redshifts, the occurrence of dark regions like the dust clouds in the 
Galaxy, and the rates of novae (Trimble discusses 14 points of argument in total). The 
argument continued during the 1920s, during which the balance shifted in favour of the 
nebulae being extragalactic. 

Oort et al. (1924) showed that there was a halo of stars round the Galaxy occupying 
the same sphere as Shapley’s globular clusters. Further confirmation of the size of the 
Galaxy came from the work of Lindblad (1927) and Oort (1927), who, in a series of 
papers, proposed and observationally verified the differential rotation of the Galaxy and 
thus its scale and the motion and position of the Sun within it. This led to an estimate 
of 10 kpc for our distance from the Galactic centre§. Hence the Sun could no longer be 
considered the centre of the Universe. (The work by Trumpler (1930a, 19306) directly 
measuring obscuration, and thus showing how the apparent discrepancy over the size of 
the Galaxy arose, came after Hubble’s announcement.) 

Slipher was building up the catalogue of known red- and blue-shifts of nebulae. By 
1917 he had 25, only 4 of them blueshifts (Slipher 1917) and Eddington (1923) was able 
to use 41, including 5 blueshifts. 

Hubble (1925a, 19256) obtained distances to M31, M33 and NGC 6822 calibrated 
by observing variable stars (following some work by Duncan), which he identified as 
Cepheids. (Interestingly, Hubble obtained a less good estimate of the distance to 
M31 than had been obtained by Curtis using observations of novae: see e.g. Steer 
(2011) for references to this and other early estimates.) In the first of these papers 
he noted that such stars had already been detected in 3 more galaxies. He estimated 
the magnitude of M31 as -21.8, corresponding to a distance of 285 kpc. In Hubble 
(1926), which described his classification of nebulae, a somewhat controversial matter 

§ Accurate measures of this distance are still difficult. The current conventional value is 8.5 kpc. 
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(Christianson 1995, Chapter 8), he gives distances to 32 galaxies and shows a relation 
between their absolute magnitudes and that of their brightest stars. The continuation 
of this work culminated in his 1929 paper. 

It has been argued in retrospect that we should have realised there was expansion, 
because it could explain the fact that the sky is dark at night even though in an infinite 
static universe every line of sight should end on a star and so be bright (Olbers’ paradox). 
This argument has been addressed and dismissed by Harrison (1981, chapter 12). He 
points out (a) that foreground stars obscure background ones, so only a finite number of 
stars could be seen and (b) with any reasonable lifetimes, stars cannot provide enough 
energy: a calculation shows that the “paradox” requires the most distant contributors 
to the light to be at 10 23 light years. 

2.2. The theoretical developments: FLRW models 

The held equations of the theory of general relativity (GR) can be written as 

G a b ■ Rab 2 Rdab ^T a b T A(^ a fe (1) 

relating the “Einstein tensor” G a b of a pseudo-Riemannian spacetime to its energy- 
momentum content The conventions and definitions used above are defined in 

the next few paragraphs. Here k = 8i tG/c 4 , where G is the Newtonian constant of 
gravitation and c the speed of light, in order to agree with Newtonian gravity in an 
appropriate limit, and A is the cosmological constant. Einstein’s initial version of GR 
(Einstein 1915) did not include the cosmological constant. He added it in Einstein 
(1917) precisely in order to have a static model of the Universe. 

The spacetime has a metric g a b of signature ±2 (the sign choice is conventional), 
defining the scalar product of two tangent vectors v and w at a point p to be g a b(p)v a w b . 
The vectors’ components here are given, in terms of some suitable choice of basis vectors, 
{e a } (a = 1, 2, 3, 4), by v = v a e a for a vector v. Under gravity alone, test particles 
move on the geodesics of this metric. 

The formulae relating the metric, the connection T a b c and the Riemannian 
curvature, in coordinate components where e a = d/dx a , are 

T be = 2^ ( 9bd,c T gdc,b 9bc,d )) (2) 

T)Ci -pa pa i pe pa pe pa {o\ 

bed *- bd,c *- bc,d i *- bd*- ec *- be*- edi 

where g ad is the inverse of gb c ■ Here the subscript , b denotes a partial derivative in the 
e& direction, while ; b will similarly denote a covariant derivative. 

The energy-momentum tensor of the matter content, T ab , is assumed to obey 
T ab -b = 0; this generalizes the usual conservation laws to the curved spacetime. 

The FLRW models are based on the Robertson-Walker metric, which can be written 
as 

ds 2 = a 2 (f)[dr 2 + E 2 (r, K){ dd 2 + sin 2 d<y? 2 )] — df 2 , (4) 

where K can be normalized (by re-scaling as needed) to 1, 0 or — 1. K characterizes 
the three possible curvatures of the hypersurface t = constant and E(r, K) = 
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sin r, r or sinhr respectively. The notation a here was used by Landau & Lifshitz 
(1941) and Lifshitz (1946) and followed by Kramer et al. (1980). It has become generally 
adopted. Other older literature used instead R for radius, S for scale factor, or i for 
length scale. 

This form is spherically symmetric about any point, and is homogeneous on each 
t = constant hypersurface. Robertson (1935, 1936) and Walker (1935) independently 
showed that any spacetime spherically symmetric about every point had to be spatially 
homogeneous and admit this form of the metric. That is a purely geometric result, and 
thus the form (4) is commonly used in modified gravity theories as well as in GR. 

The RW part of the FLRW name is thus slightly anachronistic in a discussion of 
the development of the GR models before 1929. However, by 1929 quite a number of 
solutions of (1) using (4) were known, the most important contributions to the expanding 
models being those of Friedman and Lemaitre discussed below: hence the name we now 
use. (Those exact solutions now known are summarized in Chapter 14 of Stephani et al. 
(2003) and references therein.) 

For the metric (4), the Einstein equations (1) necessarily imply that the energy 
momentum has the perfect fluid form|| 

T a b T ph a bi h a b • gab T U a Ub , (5) 

where u a is a unit four-velocity; in (4) u a is orthogonal to t — constant. Assuming d ^ 0, 
the equations (1) reduce to 

3d 2 = /qua 2 + A a 2 — 3/1, (6) 

fi + 3(n + p)a/a — 0. (7) 

In honour of Hubble, a/a is denoted H. 

Einstein (1917) gave the Einstein static universe in which K = 1 and a = 0. When 
a — 0, p, is constant and one has to add the equation 

k(h + 3 p) = 2A 

to (6)-(7). It should be noted that Einstein’s static solution was a radical departure both 
from observation and the two usual models of the day, Herschel’s and Island Universes, 
in that it assumed the Universe was uniform and isotropic in space. It also introduced 
the description of the matter content as a fluid, widely used since. 

The other solutions with a = 0 are forms of the empty spaces of constant curvature 
- flat space, de Sitter space (A > 0) and anti-de Sitter space (A < 0). It was soon after 
Einstein’s paper that de Sitter (1917a, 19176) found his eponymous metric, which he 
gave in several sets of coordinates including (in amended notation and units and with 
the opposite sign convention) 

ds 2 = — cos 2 (r / R)dt 2 + dr 2 + R 2 sm 2 (r/R)(dd 2 + sin 2 0d(p 2 ). (8) 

|| As stated, this form assumes the units are chosen so that c = 1: for normal units one must replace 
P by p/c 2 . 
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This solution has /i = 0 = p, and A = 3/R 2 . Note that de Sitter did not give this in the 
form (4), i.e. not in coordinates referred to an expanding congruence. If that is done 
the metric can be written as 

ds 2 = -dt 2 + e Ht (d , i/j 2 + cos 2 ^[d 0 2 + sin 2 ddcp 2 }), (9) 

a form first found by Lemaitre and by Robertson. 

de Sitter was aware that there are redshift effects in this metric. The exact form 
of those effects depends on which congruences of emitters and observers are being 
considered, and some confusion arose in early years because authors were not always 
careful to distinguish between the possibilities. Underlying the ambiguity is the fact 
that the de Sitter metric is four-dimensionally homogeneous and therefore there is no 
naturally preferred set of worldlines or observers. 

The principal choices were (a) observers on worldlines static in the metric (8), 
requiring some non-gravitational force to remain in their positions, (b) freely-falling 
worldlines viewed by observers static in (8), and (c) the expanding congruence used 
in (9). For (a) there is just gravitational redshift, as for static observers in the 
Schwarzschild metric, for (b) the Doppler shift of the emitters relative to the observers 
has to be added, and for (c) the redshifts come just from the expansion. The three 
approaches lead to different magnitude-redshift relations. Lemaitre, Weyl and others 
noted there would be a linear velocity-distance relation in case (c). 

de Sitter (1917a) called the Einstein static solution “system A”, and (8) “system B”, 
names that remained in common use until after 1929, and noted three redshifts (from 
Slipher and others), using them to infer distances assuming the galaxies were static in 
system B. Thus de Sitter’s solution prompted the first analyses of the redshift data, by 
several authors, in terms of a theoretical model. For example, Eddington (1923) used 36 
redshifts and 5 blueshifts, mainly obtained from and by Slipher, some of them otherwise 
unpublished at the time, while Lundmark (1924) assumed that galaxies are standard 
objects, and deduced distances in units of the M31 distance. Lundmark then showed 
a degree of correlation between these distances and Slipher’s redshifts, but with large 
scatter, and did not interpret the results as showing an expanding universe. Although 
there were no reliable distances, some authors noted a linearity between velocity and 
distance. 

Weyl’s contribution is particularly notable in that he obtained a general formula 
for redshift in any model and, in the 1923 fifth edition of his book^j argued for a non¬ 
stationary model, considered an expanding region within de Sitter space, and proposed 
a distance scale corresponding to an H of 103 km/s/Mpc (Ehlers 2009). 

In 1922 and 1924, Friedman" 1 " published his expanding universe models with positive 
(Friedman 1922) and negative (Friedmann 1924) spatial curvature*. (6) is thus called 

According to Ehlers (2009), this edition has not been translated. 

+ Here I use the transliteration on his 1922 paper, which he used in later life. (I am grateful to Michael 
Heller for this information.) It is the more correct English transliteration from the original Russian. 
The commonly-used version, Friedmann, a German transliteration, appears on the 1924 paper. 

* Both these papers have been translated and reprinted as Friedman (1999). 
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the Friedman equation. The matter content of these models was dust (so p — 0), 
formed by the energy-momentum of the “gas” of galaxies, taken to be pressureless since 
collisions are infrequent. (When discussing FLRW models the dust constituent of the 
energy-momentum content is sometimes just called matter.) It is a curiosity of history 
that the zero curvature counterpart of the Friedman models, the Einstein-de Sitter 
model, was found only in Einstein & de Sitter (1932), although Robertson (1929) had 
written down the general equations for all three K values. 

Lemaitre (1927) discussed the more general FLRW case with both dust and 
“radiation” (a fluid obeying p = p/3, which arises naturally from averaging over an 
isotropic distribution of particles moving at the speed c, e.g. photons), as well as A. It 
is this paper, together with Friedman’s two, which justify the FL part of the FLRW 
name. Lemaitre set out to find a model with nonzero matter content and a set of 
expanding worldlines. He chose K = 1, in order to have finite spatial extent, and 
assumed the elliptic topologyjj. 

Lemaitre then found the dynamic solution, later entitled the Eddington-Lemaitre 
solution, which tends to an Einstein static universe as t — )■ — oo and to a de Sitter 
universe as t —> oo (this is presented graphically in Figure 2). Luminet’s commentary 
on the reprint of this work (see Lemaitre (2013)) describes the evidence that Lemaitre 
actually calculated the behaviours of all the K = 1 models, although he was apparently 
unaware of Friedman’s earlier work until 1929 when Einstein told him about it. 

Lemaitre also derived the redshift formula for light observed by an astronomer A 
from a source G in an FLRW (or just RW) model, where A and G are assumed to be 
at constant spatial coordinates in (4): 

1 + z = a A /a G . (10) 

Apart from Weyl’s book (see above), this was the first time redshift had been related to 
an expansion of the universe rather than an (apparent or real) motion of galaxies within 
a static spacetime. Lemaitre then related this formula to the known astronomical data 
(see the next section). 

Lemaitre’s // incorporated conservation of a total mass M. (7) is the remaining 
non-trivial Bianchi identity for the metric (4), and governs the evolution of the matter 
content. Even today its major constituent is assumed to be dust, representing the 
visible galaxies and invisible cold dark matter, CDMff. Here cold means the matter’s 
constituents have small kinetic energies compared with rest mass, and thus exert only 
negligible pressure. The matter content also includes the CMB, but this has a much 
smaller density now than the dust. In a Big Bang model the dust and radiation had 
equal densities at t eq after the bang: t eq is of the order of 10 4 yr. 

In an FLRW model, dust has a density pd = Md/a 3 , where M ( i is a constant, 
and “radiation” similarly has p r = M r /a 4 . Lemaitre thus had (in this notation) 

jt This topology choice makes no difference to the dynamics. 

ff This name amuses British scientists because it was the abbreviation for the most popular UK 
chocolate brand. 
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p = Md/a 3 + M r /a 4 , p = M r /3a 4 , but noted that M r could be neglected when 
considering the application to astronomy. 

We define the density parameter fl m := np/3H 2 , and similarly fid, U r , Ha = A/3 H 2 
and fix = — K/a 2 H 2 . Note that necessarily (from (6)) fltotai = flm + 11a + fix = 1 - At 
the end of inflation, conversion of the energy-momentum of the inflaton into radiation is 
assumed, giving fl r ~ 1, with only very small contributions from A and K. Because of 
their dependences on a, flx/(fl m + Ha) will remain negligible during expansion unless 
A = 0. (A similar remark applies for the forms of dark energy other than A that 
have been proposed.) That fl K /(fl m + ^a) is small can thus be regarded as a testable 
prediction of inflation. Inflation currently does not predict the present-day ratio of dark 
energy to dark and luminous matter. 

The deceleration parameter q is defined by q := —a/aH 2 , and obeys 

q = \(fl m + KpH~ 2 ) - fi A - (11) 

Note that the sign adopted in the definition of q was related to the fact that if A = 0 a 
positive q was to be expected. 

Many of the known solutions generalize the solutions for “dust” and “radiation” 
or a combination thereof, often assuming one or more constituents with a barotropic 
equation of state p = p(p), which is frequently chosen to be of linear form, i.e. p = wp 
where w is a constant. That form for the overall p leads to 

q = lfi m (l + 3w) - n A . (12) 

Note that A is equivalent to such a barotropic fluid with w = —1, and that if A = 0, 
w = —1/3 is the critical value separating accelerating from decelerating universes. 

As Heller (1974) pointed out, we still lack any detailed modeling showing that 
the fluid approximation first used by Einstein is valid throughout the different phases 
of the Universe’s evolution. In particular, while in the early universe there are very 
large numbers of particles in small volumes, so the averaging usually implied by a fluid 
approximation (see e.g. Batchelor (1967), section 1.2) should be valid, it is less than 
clear that this can be smoothly carried over to a present day “gas” of galaxies, where 
averaging is over only small numbers of particles. It may be that the unknown nature 
of the cold dark matter now inferred to be present throughout the Universe is such that 
it resolves this issue. 

More recently (minimally coupled) scalar held solutions with a potential V (</>) have 
been widely studied: here the held </> obeys a held equation 



and in FLRW models gives an energy-momentum with 
p = \4> 2 + V (</>), p -V(<j>) 

where the dot denotes d/dt. Such helds have in particular been used to model the 
inhaton, the dark matter, and the dark energy, and they may arise as effective helds in 
(e.g.) considerations of averaged inhomogeneities (Buchert et al. 2006). 
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The perfect fluid form required by (4) can also be constructed from two or more 
forms of matter not individually having that form of energy momentum (see e.g. Coley 
& Tupper (1983)). 

2.3. Hubble’s 1929 paper 

As set out in the previous two sections, work on both the theoretical and observational 
bases for expanding models of the universe had gathered pace in the 1920s, and begun 
to interact, but Hubble’s 1929 paper was undoubtedly the turning point. 

In Lemaitre’s 1927 paper he, remarkably, used 42 galaxies, with redshifts obtained 
from Stromberg and apparent magnitudes from Hubble, to estimate the expansion rate. 
To do so he assumed, in the same manner as Lundmark (1924), that each galaxy has 
an absolute magnitude equal to the mean of those whose absolute magnitudes had been 
measured by Hubble (an assumption which inevitably increases scatter) and found the 
relation we call Hubble’s law. While this might suggest renaming it Lemaitre’s law, the 
results crucially depended on Hubble’s work (and Slipher’s), and only in Hubble’s work 
were goodf individual galactic distances used. Perhaps the right assignment of credit is 
to Hubble for the observational facts and Lemaitre for the interpretation. 

It was thus Hubble’s (1929) paper which gave a firm observational basis for the 
linear relation between distance d and recession velocity v of galaxies, Hubble’s law 

v = Hd . (13) 

In the paper, Hubble plots the velocities and distances of 24 nebulae, with speeds up 
to about 1000 krn/s (i.e. a redshift around 0.03). To obtain them he used the brightest 
star and Cepheid methods. The most distant four, which he identifies as members of 
the Virgo cluster, were assigned distances about 2 Mpc. (Taking a modern value of 
about 70 km/s/Mpc for H, these galaxies are in fact at a distance of about 14 Mpc.) 
He also estimated an average distance for a further 22 nebulae, using, as Lundmark and 
Lemaitre had, the mean absolute magnitude of the galaxies with measured distances 
and the measured apparent magnitudes, and plotted that point; his plot is Figure 1 
(which he labels by velocity and distance rather than z and m). 

The coefficient H in (13) (which Hubble himself denoted K) is known as the Hubble 
constant, although it varies with time in observationally viable FLRW models: Hubble 
estimated it as 500 km/s/Mpc. (He actually found H = 465 ±50 km/s/Mpc from the 24 
galaxies and 513 ±60 km/s/Mpc by treating them in 9 groups.) Lemaitre’s analysis had 
been done with and without a weighting intended to reduce the influence of the more 
distant galaxies on the result, as the observations of those galaxies seemed less reliable, 
and he found H = 575 (without the weighting) and H = 625 (with it), in km/s/Mpc. 

It is worth noting that, like Eddington and Lundmark, Hubble refers only to 
redshifts in de Sitter space and not to the Friedman-Lemaitre expanding models. 
Other leading scientists remained unaware of the expanding universe interpretation, or, 


t Up to overall scale. 
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Figure 1 . [Hubble’s original caption] 

Velocity-Distance Relation among Extra-Galactic Nebulae. 

Radial velocities, corrected for solar motion, are plotted against distances estimated 
from involved stars and mean luminosities of nebulae in a cluster. The black discs 
and full line represent the solution for solar motion using the nebulae individually; the 
circles and broken line represent the solution combining the nebulae into groups; the 
cross represents the mean velocity corresponding to the mean distance of 22 nebulae 
whose distances could not be estimated individually. ©US National Academy of 
Sciences. 


preferring a static picture, were initially unconvinced by it. Hubble himself is quoted 
by his biographer as saying in 1937 “Well, perhaps the nebulae are all receding in this 
peculiar manner. But the notion is rather startling” (Christianson 1995, p. 201) and 
even in his Darwin Lecture (Hubble 1953) talked of competing interpretations and said 
the law should be regarded as “an empirical relation between observed data”. 

The reasons why others failed to pick up on Lemaitre’s discovery are considered 
by Luminet in his excellent editorial note to the recent reprint (Lemaitre 2013); only in 
1930, when Lemaitre sent a copy of his paper to Eddington, who then forwarded it to de 
Sitter and Shapley, did the expanding model come to the fore. Eddington (1930) also 
discovered the instability of the Einstein static metric to small changes of the parameters 
(whence his name is attached to the Eddington-Lemaitre model). This proof and his 
advocacy of Lemaitre’s work led to the wide acceptance of the evolving models, although 
in Eddington (1930) he noted that “it is possible that the recession of the spirals is not 
the expansion theoretically predicted; it might be some local peculiarity...”, a good 
point in view of the small sample then available. Einstein (1931) then renounced the 
cosmological constant and considered a Big Bang model with recollapse (O’Raifeartaigh 
& McCann 2014). 

In the English version of Lemaitre’s 1927 paper, published in 1931, the part on 
the magnitude-redshift relation is omitted. Research by Livio (2011) found that this 
was not, as has sometimes been claimed, a change made editorially, but was made by 
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Lemaitre himself. Why that was done remains unclear. Luminet has discussed the other 
changes made in the 1931 version. 


3. Measuring the Hubble “constant” 


3.1. Measuring and interpreting redshifts 


Redshifts of galaxies can be rather accurately measured. This is done by identifying 
patterns of lines in spectra which are characteristic of emission or absorption by 
particular atoms or molecules, and comparing the observed frequencies with those 
measured in the lab. 

The method does of course assume that the emitted frequencies are not affected by 
the evolution of the Universe, and some explanations have abandoned that assumption. 
It has been examined in a number of works over the years, the most recent being 
centred on discussions of possible variations in the constants of nature, principally the 
dimensionless fine structure constant a. Variation in both time (Murphy et al. 2003) 
and space (King et al. 2012) has been claimed, the latter showing a dipole which may 
be correlated with that of other indicators (Mariano & Perivolaropoulos 2013). Such 
a variation can be described as a variation of any of the dimensionful constants used 
to define a, e.g. the velocity of light c. The claimed, and still controversial, observed 
variation is small enough that it does not significantly affect measurement of the Hubble 
constant. 

Treating light in the geometric optics approximation, one can show that if k a is the 
vector tangent to a light ray from an emitter G to an observer A, and objects have four 
velocities u a , then the red- (or blue-) shift observed by A is given by 

1 + z = ( u a k a ) A /[u a k a ) G . (14) 


If the u a are the velocities of timelike lines in the flat (Minkowski) space of special 
relativity, then for purely radial relative motion the redshift would imply a recession 
velocity v given by 


1 + z 


1 + v/c 
\ 1 — v/c 


(15) 


the Doppler effect. For v small compared with c, z ~ v/c. 

Since flat space is a good enough approximation to general relativistic geometry 
out to a distance given by a square root of the magnitude of the Riemannian curvature 
components, using the velocities given by (15) was a good enough approximation for 
Hubble and remains a good one for considerably larger distances. (From (16) below, to 
a redshift z > 1 with current Q m and Da values in (12).) 

However, within general relativity the expanding universe models are not flat. Since 
k a is geodesic, the change in the ( u a k a ) of (14) along a ray is clue to the difference 
between {u a )c parallel-transported from G along the ray to A and (■m“)a• This has to 
be computed using the equations for geodesics in the curved space and the assumed 
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motions of the emitter and observer, which are usually taken to be those of comoving or 
“fundamental” observers in an FLRW model, i.e. those at constant spatial coordinates. 
For such observers this gives exactly the ratio of the spatial distances between them 
at the times of emission and reception, cf. (10); there has thus recently been some 
debate about whether the redshift should be thought of as due to a Doppler effect or 
not (Kaiser 2014). One aspect of the argument has been that the calculation does not 
involve the Riemann tensor, only the connection, and therefore should not be considered 
gravitational: however, the Riemann tensor and its derivatives completely determine 
local geometry, including the connection (see e.g. Theorem 9.1 in Stephani et al. (2003)). 

One should note that galaxies and other sources with measured redshifts will not be 
exactly following worldlines of fundamental observers in a best-fit FLRW model. They 
will have “peculiar motions” relative to that velocity. These tend to be small compared 
with the cosmic expansion velocities, e.g. they are not more than a few thousand krn/s 
against the velocity of ~ 2 x 10 5 km/s at redshift 1 implied by (15). While thus 
not critical in deriving the Hubble constant, measured peculiar motions are of great 
importance in identifying large scale gravitating structures such as the Great Attractor 
(Lynden-Bell et al. 1988). 

At the present time observed redshifts range up to 8.6 (Lehnert et al. 2010), and 
candidates in the range of z 8.5-12 have been identified (Ellis et al. 2013). 

Spectral lines in visible light are not the only way to measure redshift. One can 
also measure it from, for example, the 21cm line of neutral hydrogen, common in both 
absorption and emission by intergalactic gas clouds. 

The origin of redshift in an expanding pseudo-Riemannian geometry is clear, but 
there have been a number of controversies about this FLRW interpretation. Apparent 
physical associations between objects of different redshifts led to a prolonged debate 
(Field et al. 1973). Other authors thought there was a “tired light” contribution, or 
banding of redshifts within clusters. 

3.2. Establishing the distance scale 

Redshifts are relatively easily and well measured. Measuring the true distances is much 
trickier: it is this which makes giving an accurate absolute value for H hard. 

The full story of how astronomical distances are obtained involves a great many 
astrophysical phenomena and techniques, and has generated a series of subjects of debate 
over several decades. By itself, it merited a very good book (Rowan-Robinson 1985); 
and there have been subsequent developments, probably including some of which 1 am 
unaware. So what follows is just a short summary. 

The principal idea, in measuring distance to other galaxies, is to measure the flux 
received from a “standard candle” for which an actual, intrinsic, luminosity L (the total 
output, or the output at given frequencies) is known, and then infer the distance by 
comparing the observed flux and the intrinsic luminosity. Standard candles are usually 
assumed to radiate isotropically. 
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There are several obvious problems with this procedure: sources do not emit 
uniformly across different frequencies, so one wants to make the measurements in such 
a way that the intrinsic luminosity and observed flux refer to the same rest frequency; 
standard candles are not strictly standard and one has to allow for the intrinsic variations 
between members of a class, and possible resulting selection effects; and to increase the 
distance range covered one has to use intrinsically brighter standard candles, meaning 
one has to calibrate the intrinsic luminosity of the new class from members of it which 
are clearly at the same distance (e.g. because they are in the same galaxy) as members 
of known fainter classes. The series of increasingly bright classes form the “distance 
ladder”. 

For those new to this topic, the situation is complicated by astronomers’ adherence 
to magnitude and parsec, rather than SI, units. Magnitudes m arose in ancient 
astronomy. The Greeks classed the brightest stars as being of first magnitude, the next 
brightest as second magnitude, and so on: higher magnitude objects are thus fainter. 
Because the eye responds essentially logarithmically to received light, this means that 
the magnitude m oc —2.5 log 10 L for sources of intrinsic luminosities L at a fixed distance: 
the factor 2.5 was proposed by Pogson in 1856 so that 5 magnitudes corresponds to a 
factor 100 in luminosity, agreeing to a good approximation with historic values for m. 

To complicate matters, the zero of magnitude depends on wavelength, and on the 
stars or other objects taken as calibration standards (see e.g. Bessell (2005)). The two 
main systems now in use are the Vega (or Johnson) system and the AB system. The 
AB system is defined so that zero magnitude is 3631 Jansky in every band, where 1 
Jansky = 10~ 26 IF /m 2 / Hz. The Vega system, originally defined so the star Vega had 
zero magnitude in all bands, has been revised so that Vega is now magnitude 0.02-0.03. 
The two systems are close in the visual V band but, for example, in the near infra-red K 
band differ by 1.85 magnitudes. (As a historical note, a table produced by Hale Bradt 
in 1979 gave, for the wavebands U, B and V much used in the past for photometric 
data, the values 1896, 4267 and 3836 Jansky in those bands.) 

Apparent magnitude m is given by the measured flux, in Wm~ 2 , in the relevant 
waveband. Absolute magnitude M is defined as the apparent magnitude the source 
would have if at a distance of 10 pc. For extended objects this has to be interpreted as 
the magnitude a point source of equal luminosity would have at that distance. Then, 
assuming an inverse square law for brightness, the “luminosity distance” Di is given in 
pc by 

M -m= -2.51og 10 (ZVlO) 2 => D L /{ 10 pc) = 10 a2(m ~ M) . 

The definition of luminosity distance implicitly assumes that there is no redshift between 
G and A. Redshift factors affect distance measurements in two ways: they affect cross- 
sectional area measurements of beams of light by relatively moving observers, and 
produce spreading out of the spectrum. 

If one defines the observer area distance in terms of the cross-sectional area d Sq of 
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r\ := dS G /dn A , 

then Di = (1 + z) 2 r A • Similarly one can define a distance r G from the solid angle dfl^ 
subtended at G by an area dS A at A using r G := dSU/dDc; then Dl = (1 + z)r G , 
r G = (1 + z)r A . When dealing with fluxes in specific intensity ranges (per Hz), one also 
has to allow a (1 + z) factor relating the widths of the wavebands. Past authors have 
not always been clear about what distance measure they meant, and thus the various 
factors of (1 + z) were not always treated correctly. 

It should be noted that as a consequence of the relations between the distance 
measures, a check on the value of D L is in principle supplied by measuring r A , assuming 
one knows z and the intrinsic cross-sectional area of a class of objects. The resulting 
analogue of the m — z relation is usually denoted the 6 — z relation, where 6 is the 
measured angular diameter of the objects. 

The first step in establishing the distance ladder is measurement of trigonometric 
parallax, the same technique humans and other animals with two eyes set apart use to 
infer distance. In astronomy one observes the (small) change in the direction in the sky 
of an object, usually a star, when the Earth is at opposite points of its orbit round the 
Sun (i.e. 6 months apart). The parallax is then one half of this, so it is R<$/D where 
Rq is the radius of the Earth’s orbit and D is the distance. (For this purpose one can 
assume Euclidean geometry.) One parsec (pc) is defined as the distance at which an 
object in a direction perpendicular to the Earth’s orbit has a parallax of one arc-second. 
The known size of the Earth’s orbit (i?® is roughly 1.5 x 10 s km) gives the SI units value 
of this (to 1 d.p.) as 3.1 x 10 13 km: one parsec is 3.26 light-years. Distances within the 
Galaxy lie in a range from about 1.3 pc (the distance to Proxima Centauri, the nearest 
other star to the Sun) to 20 kpc or so. M31 is at a distance of about 800 kpc. Distances 
to the most distant galaxies measured by standard candle type methods are of the order 
of Gpc. In modern galaxy surveys, Hubble’s law is inverted in order to infer (relative) 
distances from redshifts and so derive separations and maps of structures. 

Classical observations using parallax could lead to errors as much as 10% at 30pc. 
A major target was and is the Hyades open cluster, a group of stars used to calibrate 
methods used to make the next steps, such as using stellar spectroscopy to identify 
stars which have common properties and so can be standard candles, and the technique 
of main sequence matching, described briefly below. The Hyades is the open cluster 
nearest to us and is in the constellation Taurus: its stars have been and are studied in 
great detail. 

More recently, parallaxes have been measured using satellites such as Hipparcos 
and the Hubble Space Telescope (de Bruijne et al. 2001, McArthur et al. 2011): 
the GAIA satellite which commenced observations in 2014 is expected to enhance 
these considerably. At the distances accessible to direct parallax measurement, other 

§ The notation here differs slightly from that of Ellis et al. (2012). Other authors denote r A by D A or 

d A - 



Hubble’s law and the expanding universe 


17 


methods which are used as confirmations or checks include cluster parallaxes (using 
apparent convergence of proper motions, i.e. tangential velocities as seen from Earth) 
and statistical parallaxes of some sample of stars. From these observations, the Hyades 
are now agreed to lie at about 47pc. 

Calibrating by Hyades stars, one can use stellar spectroscopy for individual stars, 
and the position in the luminosity-temperature diagram (the Hertzsprung-Russell 
diagram) of the zero-age main sequence, the curve which newly formed stars settle 
to as they enter the phase in which their energy mainly comes from fusing hydrogen to 
helium. 

A next step in the ladder comes from the period-luminosity or period-luminosity- 
colour relations for variable stars (the first of these, for Cepheids, had been discovered 
by Henrietta Leavitt in 1908). RR Lyrae stars, W Virginis stars and Cepheids have been 
used, with various difficulties of calibration. These indicators, together with novae and 
supernovae, can be regarded as primary indicators, enabling calibration of secondary 
indicators which can be used in parallel to observe to even greater distances (Rowan- 
Robinson 1985). 

The supernovae considered as primary indicators are of two types (these do not 
cover all supernovae). For both types, decay of unstable radioactive isotopes, notably 
56 Ni, formed by the explosion is an important, and for SNla the only, source of the 
light. Type If supernovae, those with no hydrogen lines in the spectrum, are due to 
core collapse of young massive stars, and are seen in spiral galaxies. Theoretical models 
of surface brightness (in the simplest case, assuming a black-body spectrum) and the 
observed luminosity can be used to derive an angular size and this can then be combined 
with Doppler redshift measurements to obtain the velocity of expansion and thus the 
linear size and distance: this is called the Baade-Wesselink method. 

Type 1 supernovae have great homogeneity in the time variation of their luminosity 
and colour. They are believed to be clue to a white dwarf exploding because accretion 
had raised its mass above the Chandrasekhar mass limit: evidence for this has come 
from observations of Nugent et al. (2011), but there is still uncertainty about the origin 
of the accreted mass. Type la (the ones with a strong ionised silicon absorption line) are 
the brightest: see section 5 for the resulting m — z data. Over 100 were known by the 
1980s and new ones are now being discovered at a rate greater than 2000 per annum. 

Secondary indicators include ionised hydrogen (HH) regions, the brightest globular 
clusters in a galaxy, the brightest stars in a galaxy, the Tully-Fisher relation between the 
luminosity of spiral galaxies and the width of the 21 cm emission of neutral hydrogen 
atoms in those galaxies, and finally overall properties of galaxies (colour-luminosity 
relations, luminosity classes, sizes, brightest cluster galaxies). 

Hubble based his distance measurements on observations of Cepheid variable stars, 
calibrated by observations in the Galaxy. Unfortunately, as 1 now discuss, the variable 
stars he was observing in M31 and other galaxies were not Cepheids, but the similar but 
less bright type of variable, the W Virginis stars. He then used these measurements to 
calibrate the “brightest star” indicator, and so could fold galactic distances with both 
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3.3. Measured magnitude-redshift relations 

The first major revision of Hubble’s results was brought about by the commissioning 
of the Mount Palomar 200 inch telescope in 1950. Baade used it to observe M31. Had 
Hubble’s distance been correct, Baade should have been able to see RR Lyrae stars, and 
he could not. The identification of two stellar populations|| arose in this work. The two 
populations are distinguished by their metallicities^f. 

Baade realised that the two rather similar types of variable, W Virginis stars and 
Cepheids, in the two populations had been conflated by Hubble. W Virginis stars 
(or type II Cepheids) are Population II stars with brightest magnitudes around —3.5 
whereas (type I) Cepheids are Population I stars with brightest magnitudes about —5. 
Disentangling the two led to revision of Hubble’s distance scale by a factor of about 2.5. 

Because the principal uncertainty in magnitude redshift relations lies in the 
distances, cosmologists often use a parameter h = H/ 100, where H is in km/s/Mpc. The 
Hubble age, 1/H, is then approximately 10 /h Gyr. Baade’s h was about 2. The next 
big revision arose from Sandage’s 1958 discovery that HII (ionized hydrogen) regions 
had been misidentihed as stars: he noticed that they looked too red to be stars, as a 
result of the Balmer lines registering on red plates. Sandage gave a value h = 0.75; 
by 1975 he was using galaxies out to redshifts above 0.3, i.e. ten times Hubble’s most 
distant objects. 

Baade and Sandage’s discoveries not only ushered in a new phase in cosmological 
science, as described in Section 4.2, they also provided the final step in establishing 
the “Copernican Principle” that humanity has no special position in the Universe. 
Copernicus and Galileo had argued that the Earth was not the centre of the Solar 
System, and Shapley, Lindblad and Oort had proved the Sun was not at the centre of 
the Galaxy. The revised distance scales now showed the Galaxy was not the uniquely 
largest in the Universe. (The recent discovery of many planets orbiting stars other than 
the Sun, some possibly habitable, provides further evidence against humanity having 
a special position in the physical Universe: as yet the existence of intelligent life on 
extrasolar planets remains conjecture rather than confirmed fact.) 

There was considerable debate about the correct value for H after 1958, in which 
values between about 54 and 100 were obtained by various sets of new observations and 
analyses. (Geoff Burbidge used to show a log-log graph of the estimated age of the 
universe against the date on which the estimate was made, which was linear starting 
from Bishop Usher’s estimate of 6000 years. He thus claimed that if it was known when 
the next revision would occur one could predict its value.) Nevertheless, the current 
best estimates essentially agree with Sandage’s 1958 figure. 

It has been suggested that there are variations of H with direction, in particular 

|| Populations I and II, to which are nowadays added the very low metallicity Population III stars. 

^| In astronomy, “metallicity” is the fraction of stellar matter not in the form of hydrogen or helium. 



Hubble’s law and the expanding universe 


19 


a dipole variation due to the motion of the Galaxy with respect to other galaxies. 
One difficulty in studying this lies in relative distance scale calibrations for different 
directions. 

The value of H appears in a linear approximation for the m — z relation at small z. 
Such a relation applies for any congruence of worldlines of galaxies and observers with 
four-velocities u a in a relativistic spacetime, and in this case H = u a - a / 3. To study the 
relation at greater z one needs a more accurate model. Observationally, it is natural to 
approximate the relation as a series: the coefficient of the first nonlinear correction then 
gives the value of q. The relevant expansion for sources of equal intrinsic luminosity in 
an FLRW model reads 

rn — 5 log 2 ± 1.086(1 — q)z + 0(z 2 ) + constant. (16) 

One can extend this series, and calculate series for other measurements, not only for 
FLRW models but also for models with anisotropy and inhomogeneity (Kristian & 
Sachs 1966, MacCallum & Ellis 1970), but such approximations become poor ones at 
the large redshifts now observed. It is more usual now to compare data with numerically 
computed relations. 

The numerous attempts up to the 1990s to obtain observations sufficiently accurate 
to obtain q from (16) (as suggested by Hubble (1938)) ran into various experimental 
and theoretical problems. These included the K-correction (relating fluxes from different 
emitted frequencies), aperture correction (to make sure the flux is solely and wholly from 
the intended object), absorption in the Galaxy and the intergalactic medium, the effects 
of lumpiness arising because the matter density within the observed beams of light differs 
from the average density (Dyer & Roeder 1974), selection effects (which arise, e.g., from 
favouring the brightest in a standard candle class) and the unknown evolution of the 
sources. Estimates varied from about 0.25 to 1.6. 

The most prominent recent method for measuring the magnitude redshift relation 
has used supernovae of type la. As described in section 5 below, this led to the first 
good measurement of q, i.e. of the time variation of H , giving a value which has since 
been shown to be consistent with data from baryon acoustic oscillations and gamma ray 
burst sources. 

Yet more ways of obtaining and extending m — z may become available soon. For 
example, the use of reverbation mapping of AGNs and quasars, which exploits the time 
delay between variations in the continuum and line emission regions round AGNs and 
quasars and the theory of accretion disks, could obtain sizes and hence distances, and 
gamma-ray burst sources appear to extend the SNla relation to much higher redshifts. 

It is perhaps disappointing that the correct value of H is still rather uncertain. 
Calibration of the SNla data, still using Hubble’s classic method of observing Cepheid 
variables, leads to a value 73.8 ± 2.4 (Riess et al. 2011), while the Planck data (Ade 
et al. 2014) gives 67.3±1.2. This illustrates how hard it is to obtain precise cosmological 
distances even now, 85 years after Hubble’s work. 
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Figure 2. Scale factor a(t) for (a) A = 0, (b) A < 0, (c) A > 0. From Figure 9.1 of 
Ellis et al. (2012), itself adapted from Fig. 4 of Ellis (1971). 


4. Modeling the expanding universe, after 1929 

4-1. Modeling Hubble’s results 

There were many papers after 1929 exploring expanding universes, though initially most 
avoided the singular ones, i.e. the Big Bang cases. It was again Lemaitre who took the 
lead in realizing the significance of the singular models (Lemaitre 1931). However, 
once the concept of expansion had become generally accepted it was clear that FLRW 
models agreeing with Hubble’s data almost all expanded from a Big Bang. Only the 
Eddington-Lemaitre models asymptotic to the Einstein static model as t —> —oo avoided 
a singularity, and they were unstable. There are models that “coast” close to the 
Einstein static model for a period: these were invoked at one time to explain an apparent 
clustering of quasar redshifts around 1.95 but no such effect is now believed to exist. 

Thus by the time of Robertson’s review (1933), the behaviour of all the FLRW 
models was pretty well understood. In summary, the generic behaviours possible are 
illustrated by Fig. 2, assuming // and p are positive in the case A = 0. 

At small a the A and K terms in (6) are negligible. Then for w = 0 (“dust”) we 
have a oc f 2 / 3 , and for w = 1/3 (“radiation”), a oc y/t\ the corresponding exact solutions 
are due to Einstein & de Sitter (1932) and Tolrnan (1934). For constant w (other than 
-1) one obtains a oc t 2 / 3 ( 1 +«'). At large a, A, if non-zero, will give the dominant term: 



Hubble’s law and the expanding universe 


21 


if A = 0, then, assuming that pressure asymptotically vanishes as a —$■ oo, the value of 
K determines the far future behaviour (re-contraction if K = 1, expansion for ever if 
K = —1, approaching the Milne solution a oc t which is flat Minkowski space in FLRW 
coordinates, and convergence to the Einstein-de Sitter model if K = 0). 

As noted above, is negligible for most of the life of the Universe (for t > t eq ), 
although it is the dominant term between the end of inflation and t eq . On the other 
hand, oc a 3 implies that a nonzero A, or more generally dark energy in any 

of its proposed forms, becomes dominant at late times in an FLRW expansion. Since 
oc a, K is dynamically unimportant at small a, and for realistic FLRW models 
only becomes dynamically important at large a if the dark energy contribution decreases 
faster than Qk- Nonzero K is the generic case, but the recent evidence for nonzero A 
has meant that the effort expended on determining K has lost part of its motivation, 
since a value K ^ 0 may not be critical in determining the future behaviour of the 
expansion. That 0^/(h m + nA) is small is consistent with inflation, as discussed above. 

Taken together, these remarks, together with the considerations on thermal history 
(see below and Durrer (2015)), show why it is usual to adopt a Tolman model in the 
early universe and a dust model at late times (moderated, nowadays, by the inclusion 
of A in recent epochs). 

The main reason why expanding models were not universally accepted immediately 
after 1929 was that if (13) were universal and H = 500 had remained constant as the 
LIniverse evolved, interpretation as an expanding universe implied a “Hubble age” 1/H 
for the LIniverse of 2 Gyr. With A = 0 and positive /x and p the Hubble age is an 
upper bound for the age of an FLRW model. Hubble’s value for 1/H was less than the 
age of the Earth, which was known to be about 4.5 Gyr. That discrepancy led to the 
consideration of various alternative theories of gravity and cosmology, although most 
of the alternatives also considered the universe to be expanding: for fuller accounts of 
them see North (1965) and Bondi (1960). 

Among the alternatives the most widely considered (at least in the UK) was the 
Steady State theory first developed by Bondi & Gold (1948) and Hoyle (1948). This 
used (9) as the metric, so the universe expands in this theory: it manages to remain the 
same at all times due to continuous creation of new matter. Steady State theory had 
nice simplifying features and produced definite predictions, whereas Big Bang theory 
had some unknown parameters: accordingly Steady State attracted a strong body of 
proponents. 

Age problems involve not only the Earth’s age. Until the 1920s, the only known 
source of the required energy for a star was the gravitational potential energy of the 
star, but this “contraction hypothesis” led to a maximum age around 2 x 10' years, 
in conflict with “biological, geological, physical and astronomical arguments” (to quote 
Eddington). From about 1920 onwards (see Eddington (1926)) it was recognized that 
the power source was nuclear fusion, for main sequence stars the fusion of hydrogen to 
helium (another insight arising from relativity, in this case Einstein’s deduction within 
special relativity of the famous E = me 2 ). Estimates of stellar ages are subject to 
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significant uncertainties, due to the complexities of modelling stellar evolution. The 
greatest currently estimated age of an observed star is 14.5 ±0.8 Gyr (Bond et al. 2013) 
which is very close to current estimates of the time back to the Big Bang. 

Baade’s distances led to a Hubble age of 5 Gyr, just about compatible with the age 
of the Earth. The age of the Galaxy was then estimated to be between 3 and 15 Gyr, 
on the basis of data from meteorites, stellar evolution and dynamics, and observations 
of interstellar dust (Bondi 1960), so this too could just about be accommodated. These 
age estimates became comfortable after Sandage’s revision, which led to a Hubble age 
of about 13 Gyr and removed that motivation for alternative theories. 

Despite the age problem, FLRW models continued to be investigated between 1929 
and 1952, if perhaps more slowly than they might have been, and some aspects which 
have since assumed ever-increasing significance were first explored. 

The idea of remnant radiation from the Big Bang arose in Lemaitre (1931), 
but the resultant black-body spectrum and decoupling were not discussed then and 
Lemaitre thought cosmic rays might be the remnant. Our current picture is of course 
that baryonic matter and radiation are strongly coupled in the early universe, that 
the relevant temperatures therefore evolve together, and that the coupling ends at a 
temperature around 3000K (more accurately, at about z = 1089) although the radiation 
remains thermal (black-body). After decoupling, the radiation propagates freely (or, 
possibly, interacts with a reheated intergalactic medium). The decoupling happens 
rather abruptly and thus can be characterized as happening at a specific surface t = t dec , 
the “last scattering” surface. 

Tolman considered the thermodynamics of Big Bang models, including the effect 
of expansion on black-body radiation temperature and of a gas in equilibrium with it 
(Tolman 1934). However, he did not infer the presence of the CMB. That was predicted 
later, when the lack of equilibrium, which is forced by expansion (Stewart et al. 1970), 
was brought into play to discuss element formation, in work which also led to the theory 
of primordial nucleosynthesis. 

Such nucleosynthesis was first considered by Garnow, in a series of papers from 1942 
onwards. He recognized the need for high temperatures to enable neutron capture, and 
the possibility of this for a limited period in the expansion of radiation-dominated models 
(Gamow 1946, Garnow 1948). Alpher, Herman and colleagues developed this further, 
and carried out detailed nucleosynthesis calculations. They also predicted the CMB at 
a temperature of 5K (Alpher & Herman 1949). The outcome of these calculations was 
that primordial nucleosynthesis could produce the light elements but not the heavy ones 
(Alpher et al. 1953, Gamow 1956). 

The mechanism to form light elements starts with neutron-proton merger to form 
deuterium followed rapidly by a series of reactions to form He 4 . The deuterium is 
instantly dissociated by incident photons if T > 8 x 10 8 K. Neutrons decay to protons 
with a half-life about 660s and after this has happened no more deuterium forms. So 
the outcome depends on the neutron-proton ratio when T = 10 8 K, which gives the 
initial condition for deuterium formation, and the time scale of expansion, which causes 
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collision rates to drop as density does. It was Hayashi (1950) who pointed out that the 
initial ratio arose from processes with high T dependence which would give a thermal 
equilibrium until their time scales became slow compared with expansion. Taking that 
into account the helium to hydrogen ratios came into good agreement with observation. 
Wagoner et al. (1967) refined the calculations and included elements up to Li'. 

The theory of production of heavy elements in stars was developed by Burbidge 
et al. (1957) in the context of the Steady State theory. This work showed such synthesis 
could not produce the observed deuterium. Stellar nucleosynthesis complements the 
primordial element production in giving the observed species. 

Another important topic opened up in this period was the theory of perturbations 
of expanding models. Newtonian models were considered by various authors, notably 
Bonnor (1956): Lifshitz (1946) was the first to obtain a growth law for perturbations in 
relativistic models and introduced methods followed in later work. However, until 1980 
there were no plausible ways to generate the necessary perturbations. For example, 
in giving a summary of what was then known, Harrison (1973) noted that if thermal 
fluctuations provided the seeds for galaxies, they had to be generated between 10 -31 
and 10” 29 s after the Big Bang. 

Yet another topic introduced in this era was the study of inhomogeneous and 
anisotropic models (see chapters 17-19 of Ellis et al. (2012) for a recent review of 
these). Lemaitre (1933) introduced the spherically symmetric dust models now called 
the Lemaitre-Tolman-Bondi (LTB) models, in order to model collapse of overdense 
regions into galaxies. In the same paper he considered, after a suggestion from Einstein, 
the spatially homogeneous anisotropic Bianchi I models, in order to illustrate that the 
singularity at the Big Bang of the FLRW models was not unique to that case. 

4-2. Modeling with improved distance scales 

In the 1960s two observational developments in particular, the counts of radio sources 
and the detection of the CMB, gave the Big Bang view the edge over Steady State 
(although Steady State enthusiasts continued for some time to try to incorporate these 
observations within their theory). 

The cosmic radio source count data showed that the universe was not only 
expanding but evolving. It did so by plotting source count numbers N against radio 
flux S, so proving that the number of radio sources in the past was higher than today 
by more than could be accounted for by geometric effects in any reasonable model. The 
data from which this inference was initially drawn was inadequate, due to confusion 
between sources etc. but by 1968 these systematic errors had been corrected (Pooley 
& Ryle 1968); see Figure 3. Therefore the population of radio sources, and thus the 
Universe itself, had to be evolving. Unfortunately, the intrinsic variations between radio 
sources and the lack of detailed understanding of their evolution makes it hard to gain 
other cosmological information from them. 

The earliest paper to re-examine the CMB prediction was by Doroshkevich 
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5 22 G. G. Pooley and M. Ryle Vol. 139 



Fig. 2. The results expressed as a plot of log (N/No) against log S, where No is the number 
of sources expected in a static Euclidean universe. The lower curve corresponds to a source- 
conserving Einstem-de Sitter model. The curves have beeen normalized to No = 6-2 per 
steradian at Ssos = 15 x io -26 W m~ s Hz~ l . The dashed line is an extrapolation of the 
form N oc S~°- 8s . 


Figure 3 . The N — S curve from Pooley & Ryle (1968), Fig. 2, and its caption 


& Novikov (1964) but, as described by Novikov (2009), its significance remained 
unappreciated, as did data by Shmaonov and the observations of the CN molecular lines. 
For discussion of the discovery by Penzias and Wilson and subsequent observational and 
theoretical work see Durrer (2015). The presence of the CMB confirmed the evidence 
from element abundances, showing that the universe had been through a hot dense 
phase, and had thus come from a Big Bang. 

The generally accepted thermal history for FLRW models now started with a 
Tolman radiation universe with known ratio a/\ft (see Durrer (2015)), followed by 
a “matter era” driven by dust but with unknown 

Isotropy about our position in space was shown at that time by a number of tests. 
Galaxy number counts were only known to be isotropic to about 30% (the uncertainty 
being due to the clustering), but the radio source counts and X-ray background were 
isotropic to better than 5% and the CMB was known from about 1969 onwards to be 
isotropic to better than 0.2%. The first anisotropy measured in the CMB, in 1969, was 
the dipole due to the motion of the Sun relative to the frame in which the CMB is most 
nearly isotropic (see Durrer (2015) for details and references). 

In contrast, then and now, obtaining clear evidence for spatial homogeneity is 
difficult in that we do not receive data from objects separated from us in space but not 
in time. In particular we cannot in principle separate space and time evolution, since, 
apart from “geological evidence” in our local neighbourhood, our observations are on 
our past null cone. So to infer homogeneity implies assumptions about how the matter 
content behaves away from the null cone. It is not even easy to test the assumption that 
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there are spacelike surfaces of approximate homogeneity extending to large distances. 
For example the surface of last scattering might be a timelike cylinder intersecting our 
past null cone in the sphere we actually observe (Ellis et al. 1978) (though the particular 
model where this was considered is ruled out by other considerations). Even when one 
takes the more usual assumption that last scattering occurs on a spacelike surface, one 
needs to be able to apply some theory of the evolution objects undergo between the 
emission of the received radiation and now. These assumptions are hard to test. 

Thus to improve tests of homogeneity we would need to have a much better 
understanding of evolution of galaxies, radio sources and other objects than we do. We 
are however able to make two types of tests: we can check that there is no local evidence 
for inhomogeneity, for as far back in time as we feel confident about our understanding 
of evolution, and we can test for the presence of significant mass variations which are 
asymmetrically arranged about us by looking for their effects on the CMB. The tightest 
bounds of the latter type seem to be those of Zibin & Moss (2014). 

Thus in the 1960s and 1970s it was becoming ever clearer that the FLRW models 
satisfied the observational requirements of expansion, evolution, a hot dense phase, and 
apparent isotropy and spatial homogeneity. They accounted for the evolution of the 
light elements, while nucleosynthesis in stars accounted for the heavy ones, and they 
naturally gave rise to the CMB. The value of H was believed to be at most 100, and 
therefore there was no serious age problem. 

The models still had significant unknown or poorly known parameters: f1^, K, q 
and A. They also did not predict the baryon number B, the ratio of the number of 
photons per unit volume (in the CMB) to the corresponding count of baryons, which 
is a measure of entropy and affects the helium/hydrogen ratio and other light element 
fractions, and so on. There were attempts to obtain the FLRW model parameters not 
only from the m — z relation but also the 9 — z and N — S relations described above, the 
N — m relation for galaxies (a test proposed by Hubble (1936)), and the < V/V max > 
test, where V is the volume of a sphere of radius given by the distance of a source and 
V max the maximum volume within which it would be detectable: precise values remained 
elusive. 

It became clear that there was dark matter in the universe (see Longair & Rees 
(1973) or Coles & Ellis (1997)). The phrase “missing matter” came into use, but 
rather confusingly was used both for the mass inferred to be in galaxies and clusters in 
addition to their visible mass (which pointed to an Vt m of about 0.3: see the references 
immediately above) and by those who believed that the true Q m was 1, although there 
was no observational evidence for such a large value (or for a non-zero A). 

The period saw some important steps in understanding light propagation in the 
Universe. Sachs & Wolfe (1967) worked out the effects on the CMB of gravitational 
redshifts: the fully non-linear integrated effects were discussed by Rees & Sciama (1968). 
Sunyaev & Zel’dovich (1970) described the effect of inverse Compton scattering in 
galaxies on CMB observation. See Durrer (2015). Several papers, notably those of 
Dyer & Roeder, considered light propagation in lumpy models. 
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One aspect of FLRW models studied in greater detail was the impact of the exact 
rate of expansion during element formation (which depends on the particles present 
and on the anisotropy) on the resultant ratio of atomic species. Steigman et al. 
(1977) showed that such cosmological considerations limited the number of neutrino 
species, at that time unknown but expected from the quantum theory of the standard 
model of particle physics to be 3, to at most 5. A more recent re-analysis (Cyburt 
et al. 2005) gave 2.67 < N v < 3.85 at the 68% confidence level, while Planck’s data 
gave N e ff = 3.30 ± 0.27 (Ade et al. 2014). Similarly, Barrow (1976) used the effect of 
anisotropy on the expansion rate to give a limit 10~ 7 on present-day anisotropic shear 
a, possible because a a 2 term has to be added to the Friedman equation (6). 

Big Bang FLRW models have a singular origin. The understanding of singularities, 
defined as the presence of geodesics which could not be indefinitely continued, developed 
considerably during the 1960s and 70s. Theorems proving their existence in relativity 
(see Hawking & Ellis (1973), Tiplcr et al. (1980) and the Milestone “The singularity 
theorems (1965)”) were complemented by examples of possible behaviours and by the 
work reviewed in Bclinskii et al. (1982) aimed at describing singularities’ generic form. 
As a result of the CMB discovery, it was possible to prove that a relativistic cosmology 
that was expanding approximately like an FLRW model had to have had a trapped 
surface in the past and therefore must have had a singularity in the past (Ellis & 
Hawking 1968). 

5. Modern relativistic cosmology 

5.1. Observations 

Three important sources of observational data, the anisotropies in the CMB, the m — z 
relation itself, and galaxy surveys, have undergone major developments since the 1980s. 
Combining the results has given a much stronger base for our modeling. Moreover, other 
sources of data have been and are being added. 

The Erst to improve was the measurement of the fluctuations in the CMB (beyond 
the level of the dipole due to the Earth’s motion), discussed more fully by Durrer (2015). 
This has evolved from the first indications in the COBE data of the early 1990s through 
to the very detailed recent results by Planck (e.g. Ade et al. (2014)) and a number of 
ground-based telescopes. There are ongoing observations by the ever-increasing number 
of ground-based instruments which will give us further fine detail. 

Drawing the implications from this data depends on calculating the evolution of 
fluctuations from the input fluctuation spectrum predicted to have been formed during 
inflation, within an FLRW model (see section 5.2 and Durrer (2015)). Comparing such 
calculations with the observations puts constraints on the expansion parameters H, Q m , 
Ok and Note that these parameters give rise to accumulated effects over a long 
period, rather than referring to measurements of relatively nearby objects. 

The most unexpected addition to previous data came from the Hubble relation 
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for supernovae of type la. It was unexpected because if A = 0, as was generally 
believed, then (11) shows that the expansion rate should be decelerating. The result, 
first announced in 1998, showed instead that q < 0, implying A > 0: it won the lead 
scientists the 2011 Nobel prize. 

The CMB observations show a first main peak in the power spectrum, from which 
a length scale at the time of the last scattering of the CMB can be inferred. Expansion 
will then fix the corresponding length scale for a peak in the distribution of galaxy- 
galaxy separations at a later time. This is the BAO measurement, made in surveys of 
galaxies. Taking galaxies in a given band of redshift, around a value zb say, allows one 
to estimate the evolution between the time of last scattering and zb and thus the 11 
values, assuming an FLRW model. 

The BAO peak is a less than 1% deviation from a uniform distribution. Nevertheless 
it can be measured with considerable, albeit model-dependent (Roukema et al. 2015), 
accuracy. The same peak has recently been measured using the Lya forest in the spectra 
of quasars at redshifts in the range 2.1 < z < 3.5 (Delubac et al. 2015). Using this data 
and a value for the sound horizon obtained from the Planck data leads to a value for 
H at z = 2.34 of 222 ± 7, about 7% (2.5 a) discrepant with a flat ACDM cosmological 
model with the best-fit Planck parameters. 

The accumulated SNla data has been compared with the BAO measured by the 
WigglcZ team (Blake et al. 2011) at several redshifts out to z > 0.7 and agrees very 
well; see Figure 4. 

Values for fl m and Oa can be obtained by combining the SNla, CMB and BAO 
data as shown in Figure 5. 

Further important input comes from weak lensing. The images of distant sources 
are distorted by the light-bending due to matter nearer to us. This distortion may be 
strong, for example causing multiple images of a single lcnsed galaxy, if the light passes 
very close to the lensing object, but for many cases only distorts what would be a circle 
into an ellipse: typically the ellipticity only changes by about 1%. We do not know the 
actual shape of the lensed object. Nevertheless, statistically one can extract the density 
distribution of the mass causing the lensing. 

The first map of dark matter using weak lensing (with a nice three-dimensional 
picture) was given by Massey et al. (2007), in a 1.6 square degree area. The more 
recent data, with a method complementary to the statistical calculations (Van Waerbeke 
et al. 2013), shows detailed maps of the (dark and visible) matter in regions several 
degrees across. Several lensing surveys are in progress to improve this data to the level 
where it may also be able to give additional information on dark energy. Lensing has 
also been detected in the CMB measurements (Ade et al. 2014). 

5.2. The standard model and the development of structure 

The current standard model of cosmology is an expanding approximately FLRW Big 
Bang model, with, taking the values given by the Planck data (Ade et al. 2014), 
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Figure 4. [Caption from Blake et al. (2011)] Comparison of the accuracy with which 
supernovae and baryon acoustic oscillations map out the cosmic distance scale at 
2 < 0.8. For the purposes of this Figure, BAO measurements of Dy{z) have been 
converted into Da{z) assuming a Hubble parameter H(z) for a flat ACDM model 
with S2 m = 0.29 and h = 0.69, indicated by the solid line in the Figure, and SNe 
measurements of D L (z) have been plotted assuming Da{z) = D L {z)/{ 1 + z) 2 . 



Figure 5. Constraints (68%, 95% and 99% CL contours) in the (O m o, 12 ao) 
plane from SNIa, BAO and CMB. (From Kowalski et al. (2008).) 


Q m ~ 0.31, of which about 0.05 is baryonic matter and the rest dark matter, and 
Oa fa 0.69. It is thus known as the ACDM model. These O values do not require 
revision of our understanding of thermal history, since Oa for z > 1. 
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In the early universe, the standard model undergoes a period of inflation, with a 
rapid expansion driven by an inflaton, a field not (so far) observed terrestrially: exact 
details are model dependent. The crucial role of this early period is that it leads 
to a spectrum of classical density perturbations that is almost flat. The inflaton is 
typically modeled using a scalar held. This can give rise to a range of behaviours for 
a(t) depending on V(4>). 

The thermal history of this model is described in Durrer (2015) and will not be 
repeated here: it includes the nucleogenesis and other processes on which expansion 
exerts a major influence as described above. 

The development from the initial fluctuations to the last scattering surface and later 
times is modelled by perturbations of the FLRW background. This simple statement 
hides two problems, the choice of the FLRW model to perturb (the “fitting problem”), 
and the “gauge” issue which arises from the lack of an invariantly-defined map between 
the real lumpy Universe M and the fictitious smooth FLRW universe M'. 

The gauge issue is often described in coordinate terms, and then results in changes 
of variables similar to those of gauge transformations in field theory: hence the name. 
It is usual to identify points p € M and p' G M' which have the same coordinates, say 
x a — x 'a _ Making the same coordinate change in each manifold does not change this 
identification. If one slightly alters the map, i.e. the choice of p’ for a given p, this can 
be described as mapping x ,a to x a = x ,a + 5x a , a change of gauge. Since this is clearly 
not a physical change in the real universe, one wants a description independent of the 
gauge choice. 

Stewart & Walker (1974) showed that the only gauge invariant quantities are those 
which in the background are zero, constant scalars, or sums of multiples of the tensor 
5j. This is a nuisance, and in particular implies that there is no gauge-invariant way to 
define the perturbation of density in an FLRW model, since density is a time-dependent 
scalar. 

The fitting problem of picking the best-fit FLRW model for the comparison also 
involves gauge. The perturbed model could, for example, have greater average density 
than the initial model and so demand a different best fit. Fitting is also closely related 
to the “averaging problem” which is as follows. The equations (2)-(3) are nonlinear, and 
hence the averaged curvature is not the curvature of the averaged metric or averaged 
connection (because the average of a product, < ab > say, is not the product of the 
averages < a >< b >). This implies that the averaged density of the real universe 
may not be the same as that implied by (6) for the FLRW model whose average H 
agrees with that of the real universe. One can try to estimate the effect by calculating 
the difference between the average of the curvature and the curvature of the averaged 
metric, the “backreaction”. 

Unfortunately we do not yet have a generally-agreed way to do this. The technical 
difficulties include the need to compare tensorial quantities at separated points, and 
the necessity of defining averaging volumes in a non-covariant way (since Lorentz 
transformations map a point at a finite distance to points on a hyperbola stretching 
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to infinity). The various attempts are reviewed by Krasinski (1997, chapter 8) and Ellis 
et al. (2012, chapter 16), q.v. 

A further difficulty is that a best-fit RW metric (i.e. of form (4)) might not show 
the time evolution of an FLRW model with reasonable matter content. 

One approach much explored, because only averaging of scalars is required and 
this is well-defined, is that of Buchert: it involves averaging for fundamental observers 
and the non-commutation of the time derivative and averaging operators, leading to 
modified Friedman and Raychaudhuri equations (see e.g. Buchert (2008)). Because 
tensorial quantities are not averaged, this approach has to be completed by adding 
physical closure conditions. 

There are two main approaches to calculating the behaviour of perturbations, the 
metric-based method of Bardeen (1980) which develops the ideas of Lifshitz (1946) into 
a gauge-invariant treatment, and the covariant approach of Ellis & Bruni (1989) which 
developed ideas of Hawking (1966). There are quite a large number of papers where 
these are expounded and applied: these are surveyed and summarized in Ellis et al. 
(2012, chapter 10). A proper exposition is too lengthy to include here: 1 shall mention 
only the significance of expansion in the results. 

The role of the expansion rate in the evolution of perturbations arises because 
when perturbations have a length scale much greater than the contemporaneous Hubble 
scale, c/77, they are “frozen in”. This allows long wavelength perturbations to be tracked 
readily through the transitions from inflation through radiation domination to matter 
domination. (The Hubble scale has often been referred to in this context as the Hubble 
horizon, although it does not agree with any causal horizon. In a Big Bang the particle 
horizon at time t Q is at comoving radial coordinate distance 

f to d t 

u = / —^ 

Jo a{t) 

which in the de Sitter metric in its expanding form, (9), is bounded above by 1/H. 
Although (9) is not really a Big Bang model, since it contains no matter, it appeared 
as the approximate metric for the early universe in the initial models of inflation, and 
this may be why the Hubble scale became called a horizon.) 

As discussed in Durrer (2015), while the baryons and photons are tightly coupled 
the whole content undergoes acoustic oscillations under the competing effects of 
gravitational collapse and radiation pressure. These resulting waves travel at the sound 
speed of this medium, but become frozen in at decoupling, whence the peaks and troughs 
observed in the CMB power spectrum. 

The perturbation theory gives excellent agreement with the CMB fluctuations, but 
those observations only test scales above 150 Mpc. It is clear that nonlinear effects 
become important at smaller scales. Theories of galaxy formation and early evolution 
begin with collapse of a gas cloud, like star formation only larger, and then proceed 
by accretion and merger. The many theoretical inputs have been tabulated by Scott 
( 2011 ). 
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The main tool used to investigate the nonlinear phase of structure formation, 
beyond nonlinear perturbation theory, is large Newtonian N-body simulations. These 
can produce the principal features of the actual distributions of galaxies, walls, filaments, 
voids etc. However, they are not in perfect agreement with observation quantitatively 
and have a shaky theoretical basis, since they are non-relativistic. Their results 
do, for example, agree with the relativistic perturbation theory of FLRW models in 
the conformal Newtonian gauge (Chisari & Zaldarriaga 2011), but a recent paper 
has discussed a situation where inhomogeneities that are “easy to describe using the 
linearized general relativity” lead to a model that “taken as a whole lies in fact in the 
nonlinear regime” (Korzynski 2014). 

In a ACDM model, the first structures to form are small, suggesting that massive 
galaxies form by multiple mergers. It is possible to use a “fossil record” of space- and 
time-resolved star formation histories, and so show that massive galaxies grow their 
mass from the inside out. The spheroidal parts seem to grow mainly about 5-7 Gyr ago. 
Earlier galaxies do not have the typical virialized structures we see today (ellipticals 
or spirals) and have now been seen (using the Hubble Space Telescope) to undergo the 
mergers expected. Each merger appears to take about 0.5 Gyr and a massive galaxy 
will have undergone 4-5 by today. 

There is growing evidence of correlations between the masses and sizes of galaxies 
(in particular their spheroidal parts) and the masses of their central supermassive black 
holes (SMBH); see for example Kormendy & Ho (2013). How the SMBH controls the 
growth of a galaxy, given that it is typically only a thousandth of the total mass, is not 
yet entirely clear: it may be via the jets produced by the SMBH, fuelled by matter from 
an accretion disk, heating the surrounding gas and so controlling its collapse. What is 
clear is that this is highly nonlinear and another important cosmological application of 
general relativity. 

5.3. Other models 

Models other than FLRW can provide important cosmological information in several 
ways. They test whether features of FLRW models are peculiar to those models, and 
whether the models are robust under perturbations of parameters, and they admit a wide 
variety of potentially observable new effects. They may allow the nonlinear modelling 
of structures at a level which FLRW perturbation theory cannot address. 

The spatially homogeneous but anisotropic models, especially the expanding Big 
Bang cases, were extensively investigated from the 1960s onwards. They can be classified 
into the Bianchi types by their symmetry groups. The singularities that occur can be 
quite complicated, showing the oscillatory behaviour found by Misner (1969), and the 
more recently discovered “Mussel attractor” (Coley & Hervik 2005). Over time, good 
choices of variables have been found for the systems of ordinary differential equations 
that arise, enabling very detailed and full studies (Wainwright & Ellis (eds) 1997, Ellis 
et al. 2012). Most of the dynamics can be understood qualitatively by patching together 
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segments of the evolutions of the Bianchi I and II examples. 

Two areas of particular interest have been the approach to the singularity and the 
behaviour as t —> oo. These are relevant to our understanding of the real Universe 
inasmuch as there are Bianchi models arbitrarily close to FLRW models for any given 
accuracy of approximation and length of time, and those models can differ radically 
from FLRW models at early and late times. However, a positive cosmological constant 
tends to isotropize all Bianchi models as t —* oo (Wald 1983). Nucleosynthesis, the 
CMB observations, inflation, horizons and various quantum gravity theories have also 
been studied in Bianchi models. 

Among inhomogeneous models, the LTB models, the more general spherically 
symmetric models which were also first considered by Lemaitre, and the Szekeres models 
(Szekeres 1975), have been the most extensively explored. Inhomogeneous spacetimes 
can model over- and under-densities, nonlinear gravitational waves, and anisotropic and 
inhomogeneous initial conditions, and some fit the SNla and CMB data surprisingly 
well (for examples, see Ellis et al. (2012, chapter 19)), showing one should not too 
readily adopt the FLRW explanations. Inhomogeneities may help explain the apparent 
acceleration, as described below, but they also provide ways of making detailed nonlinear 
models of localized structures, such as the Local Group, M87, the Great Attractor 
and so on (Bolejko et al. 2010). In modeling voids and clusters, it was found that 
velocity perturbations were more effective in producing structure than the usual density 
perturbations alone. 

It is sometimes said that inflation can explain the observed homogeneity and 
isotropy, despite the fact that almost all inflationary models start by assuming it, at 
least for the observable region. Calculations in specific models (summarized in Ellis 
et al. (2012, chapters 18-19)) suggest this is not so, and more needs to be done by 
combining non-FLRW geometries and varying forms of inflation to determine the true 
position. So far it seems that anisotropy and inhomogeneity may suppress inflation, 
though there is also a model in which averaged inhomogeneities act as the inflaton 
(Buchert & Obadia 2011). 

6. Open questions, and possible future developments 

The standard model as we now have it has three big obvious unknowns in the natures of 
the inflaton, the dark matter and the dark energy. There are a number of terrestrial dark 
matter searches in progress and it is possible one of them will resolve that issue. I do not 
know of current experiments that could lead us to a fuller understanding of the inflaton: 
one would like to know in more detail how it governs the early universe dynamics and 
the generation of fluctuations, and the interactions by which its energy-momentum is 
converted to present-day matter. 

For the apparent acceleration of the expansion, a number of possible explanations 
have been put forward. While there may be astrophysical effects requiring amendment 
of our calibrations of the supernovae, and our understanding of absorption between them 
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and us, which could alter the inferred m — z relation, this seems increasingly unlikely as 
other, albeit still model-dependent, evidence for acceleration (from BAO or gamma ray 
bursters, for example) accumulates. Confidence in FLRW models, in which the various 
ways to define acceleration (Bolejko & Andersson 2008) agree due to the symmetry, thus 
leads to some form of dark energy being necessary. 

There are four possible causes of the apparent acceleration other than the simple 
cosmological constant currently used in modeling. One is a previously unknown quantum 
held (“quintessence”) with some time dependence: observations currently under way aim 
at limiting the possible time development of such fields (assuming an FLRW model). 
Both large and small scale anisotropies might provide explanations, the former by using 
a non-FLRW geometry and the latter both by effects on light propagation and by the 
backreaction. Lastly, since the inferences depend on the relativistic FLRW models, a 
modified gravity theory may be indicated. Of these the most popular explanations are 
the first and last" 1 ". 

Small scale anisotropies do affect measurements of m — z both by their effect on light 
propagation and by the backreaction corrections to (6). However, the effects on light 
propagation, e.g. on galaxy number counts (Bertacca et al. 2014), are probably only at 
the levels of accuracy claimed by “precision cosmology” (Ellis et al. 2012, chapter 15). 
The possible models of backreaction by small scale inhomogeneities were put in doubt by 
the work of Green & Wald (2011), but may still provide a possible, and to me appealing, 
explanation of the apparent acceleration (Roukema et al. 2013, Korzynski 2014). 

Large scale inhomogeneities could lead to an apparent acceleration. Since 
observations cannot directly separate spatial from temporal variations, a spatial 
variation could account for the observations, and a number of models on these lines 
have been devised (see Ellis et al. (2012, chapter 15) for a review). However, it is not 
easy to fit all the phenomena, especially those which combine data from various z. For 
example, Bull et al. (2012) show that LTB models cannot simultaneously explain the 
SNla results, the BAO, the local value of H and the kinematic Sunyaev-Zerdovich effect 
that arises (see Birkinshaw (1999)) when the observed galaxies are in motion relative to 
a frame in which the CMB is isotropic. 

There are a great many projects under way to refine the present data from the CMB, 
SNla, BAO, lensing, kinematic Sunyaev-Zeldovich effect, and other sources already 
described above. For example, there are at least 9 aimed at constraining the equation 
of state of dark energy (is it the w = — 1 of the cosmological constant?), there are a 
number of experiments aimed at measuring B-mode polarizations and there are several 
terrestrial dark matter searches. Most of these involve very delicate measurement: for 
example, a 10% difference in w from —1 implies a change of only 0.04 magnitudes in an 
SNla at z = 0.6. 

One further new window related to general relativity may be provided by 
gravitational waves (see the remarks in Durrer (2015) on the BICEP2 results). The 

+ As shown by M. Fairbairn’s count that of the 591 papers submitted to the online arXiv in 2012 
discussing dark energy, 287 concerned modified gravity theories. 
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best current evidence for the existence of such waves is provided by the very detailed 
measurements of the binary and double pulsars, where the period changes agree well 
with the expected energy loss through gravitational radiation. In the near future, the 
ground-based laser interferometric detectors, with their recently improved sensitivity, 
will give interesting results whether they make a detection or not. (Since the predicted 
emission of expected sources should be detectable, not seeing anything would cause a re- 
evaluation of our theoretical understanding.) Pulsar timing arrays (McLaughlin 2014) 
are developing to the point of being very effective ways to detect low frequency waves. 
There is still hope that the space-based interferometric detector LISA will fly within 
some of the readers’ lifetimes. 

In the seminar I referred to in the introduction, Sciama explained how the 1960s 
data favoured the Big Bang theory, and described the relevant FLRW models. We now 
have a great deal more than one known piece of cosmological data, though it is perhaps 
disappointing that that piece, i.e. the value of H , is still rather imprecisely known, 
as the divergent values at the end of section 3 show. Even more can be confidently 
anticipated. However, it is far from clear whether or when the big open questions just 
mentioned will be settled. It could even be that evidence emerges forcing us to replace 
general relativistic dynamics for the Universe. What seems likely is that expansion will 
continue to play a big role in our models. 
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